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Abstract. This paper reviews some results regarding symbolic 
dynamics, correspondence between languages of dynamical sys- 
tems and combinatorics. Sturmian sequences provide a pattern for 
investigation of one-dimensional systems, in particular interval ex- 
change transformation. Rauzy graphs language can express many 
important combinatorial and some dynamical properties. In this 
case combinatorial properties are considered as being generated by 
substitutional system, and dynamical properties are considered as 
criteria of superword being generated by interval exchange trans- 
formation. As a consequence, one can get a morphic word appear- 
ing in interval exchange transformation such that frequencies of 
letters are algebraic numbers of an arbitrary degree. 

Concerning multy dimensional systems, our main result is the 
following. Let P(n) be a polynomial, having an irrational coeffi- 
cient of the highest degree. A word w (w — {w n ), n £ Z) consists of 
a sequence of first binary numbers of {P(n)} i.e. w n — [2{P(n)}]. 
Denote the number of different subwords of w of length k by T(k) 

Theorem. There exists a polynomial Q{k), depending only on 
the power of the polynomial P, such that T{k) = Q(k) for suffi- 
ciently great k. 



1. Introduction 

Methods of symbolic dynamics are rather useful in the study of com- 
binatorial properties of words, investigation of problems of number 
theory and theory of dynamical systems. Let M be a compact metric 
space, U C M be its open subspace, / : M — > M be a homeomorphism 
of the compact into itself, and x G M be an initial point. It determines 
a sequence of points 

■r.fi-r) .r : (.r).... 

With the sequence of iterations, one can associate an infinite binary 
word 
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a, f^{x ) E U 

b, fW(x )#U 



which is called the evolution of point x . If / is invertible then nGZ, 
otherwise n 6 N. Symbolic dynamics investigates the interrelation 
between the properties of the dynamical system (M, /) and the com- 
binatorial properties of the word W n . For words over alphabets which 
comprise more symbols, several characteristic sets should be consid- 
ered: U\, . . . ,U n . Technical notions regarding combinatorics of words 
see in section 15.11 

Direct problem of symbolic dynamics consists in the description of 
properties of the word W, based on the information about dynamic 
system. Reverse problem consists in the description of (M, f,U,x), 
based on the information about W. 

We shall point out some facts, which are known from folklore. 

Minimality of the dynamical system corresponds with the uniform 
recurrence property (see section I5~T|) . 

Uniqueness of invariant measure corresponds with the following prop- 
erty. Let it be a subword of uniformly recurrent (we denote uniformly 
recurrent as u.r.) word W. For any subword u C W lets suppose up- 
per density of occurrence coincides with lower density. The invariant 
measure is unique, then. 

In what cases M is a torus and / : M — y M is its shift? It means that 
this dynamical system has discrete spectrum. Let W be a superword, 
obtained by this dynamics. Let T be shift operator. Then mismatch 
function between W and T n (W) satisfies following conditions: 

(1) There exists sequence {rii} such that p(T ni (W), W) — > 0. 

(2) There exists co-prime arbitrary large ni,nj from this sequence. 

We shall analyse these problems. We start from general construc- 
tions, regarding torus rotation questions, uniqueness of invariant mea- 
sure, minimality of dynamical system. 

The famous Sturmian sequences and some of their generalizations 
present situation of "combinatorial paradise" . It provides patterns for 
further investigation. Using language of Rauzy graphs we shall for- 
mulate criteria of a superword being generated by interval exchange 
transformation. Note that any billiard word with rational angles can 
be obtained via such transformations. (Number of directions of ball is 
finite and position of ball on the side together with its direction pro- 
vides phase point, and Phase space is union of some intervals.) On the 
other hand, using language of Rauzy schemes (obtained from Rauzy 
graphs by exchanging maximal sequences of vertices of ingoing and 
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outgoing degree 1 by arches) we get the criteria of the superword to be 
morphic. 

Concerning shifts of multydimensional torus, there is a beautiful the- 
ory of Rauzy fractals. For more complicated systems one needs the 
other patterns for investigation rather then provided by Sturmian se- 
quences. 

The rest of this paper is devoted to dynamic systems connected with 
unipotent transformations of torus. This subject was considered in [T§] . 
Issues related to the study of sequences, obtained by taking fractional 
part of values of a polynomial at integer points lead to the investiga- 
tion of such dynamic systems. This problems play an important role 
in theory of numbers, theory of information transfer and some other 
branches [HOE]. Note that unipotent transformation of T 2 has the 
same relation with circle shift as billiards with arbitrary angles to in- 
terval exchange transformations. Sequences appearing in such billiards 
are analysed in [27]. 

Inverse problems of symbolic dynamics related to the unipotent trans- 
formation of a torus were studied in paper [7J (Unfortunately it was 
published only in Russian). 

We have to point out the paper [TH] obtained independently from [7J. 
Let Q(X) be a real polynomial of degree d > 1 where the coefficient of 
X d is irrational. Define the difference operator Au n = u n+ \ — u n and 
its iterates A 2 = A o A, • • • , A d = A o A d_1 . The authors establish 
the theorem according to which the sequence (A d [Q(n)\) n >o takes its 
value on a 2 d -element alphabet and that 



a Vandermonde determinant. In particular, p(n) depends only on the 
degree d of the polynomial Q provided the coefficient of X d is irrational, 



In this situation one have to count the number of parts of torus 
division by images of hyperplane. And proof that points in different 
regions have different evolutions can be done just by dimension in- 
duction because this system provides more information, no theory of 
quasi-invariant sets and factor dynamics required. 

The main theorem of this section is 




where 



v(k dr --,k 1 )= n (kj - ki), 



l<i<j<d 



p 2 (n) = (n+l)(7i + 2)(n + 3)/6. 
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Theorem 1.1. There exists a polynomial Q(k) , depending only on the 
degree of the polynomial P, such that T{k) = Q(k) for all sufficiently 
large k. 

2. Sturmian sequences and their generalizations 

The Problems (both direct and inverse) related to the rotation of 
a circle bring to a class of words which are called Sturmian words. 
Sturmian words are infinite words over a binary alphabet which contain 
exactly n + 1 different subwords (factors) of length n for any n > 1. 

Sturmian words provide an example of correspondence between lan- 
guage of dynamical system and combinatorial properties of superwords. 
We shall formulate classical result: 

Theorem 2.1 (Equivalence theorem (|lQ],[38j).). Let W be an infinite 
recurrent word over the binary alphabet A = {a, b}. The following 
conditions are equivalent: 

(1) The word W is a Sturmian word, i. e., for any n > 1, the number 
of different subwords of length n that occur in W is equal to 



(2) The word is not periodic and is balanced, i.e., any two subwords 
u,v dW of the same length satisfy the inequality ||i>| — |w| a | < 
1, where \w\ a denotes the number of occurrences of symbol a in 
the word w. 

(3) The word W = (w n ) is a mechanical word with irrational a, 
which means that there exist an irrational a, x G [0,1], and 
interval U C S 1 , \U\ = a, such that the following condition 
holds: 



(4) Word W can be obtained as a limit of the sequence of finite 
words {uii}^, such that u>j+i can be obtained from Wi via sub- 
stitution of the following type a ki b — > b, a kt+1 b — )■ a or b kl a — y 
a, b h+1 a b. 

Iff sequence of these substitutions is periodic, then a is a 
quadratic irrational. 

Sturmian words can be considered as theoretical "paradise" and pat- 
tern for further investigations. There are several different ways of gen- 
eralizing Sturmian words. 

First, one can consider balanced words over an arbitrary alphabet. 
Balanced nonperiodic words over an n-letter alphabet were studied in 



T n (W) 



rc + 1. 




a, T a n (x ) E U 
b } T a n (x )?U 
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paper [33] and later in [35]. In the papers [10] and [TS] . a dynami- 
cal system that generates an arbitrary nonperiodic balanced word was 
constructed. 

Secondly, generalization may be formulated in terms of the com- 
plexity function. Complexity function Ty/(n) presents the number of 
different subwords of length n in the word W . Sturmian words satisfy 
the relation Ty/{n + 1) — T\y{n) = 1 for any n > 1. Natural general- 
izations of Sturmian words are words with minimal growth, i.e., words 
over a finite alphabet that satisfy the relation T w {n + 1) — T w {n) = 1 
for any n > k, where A; is a positive integer. Such words were described 
in terms of rotation of a circle in paper [15]. Note also that words 
whose growth function satisfies the relation lim^oo T(n)/n = 1 were 
studied in paper [16]. 

Words with complexity function Twin) = 2n + 1 were studied by 
P. Arnoux and G. Rauzy ([171 [22], S3]), words with growth function 
Ty/[n) = In + 1 were analysed by G. Rote [15]. Consideration of the 
general case of words with linear complexity function involves the study 
of words generated by interval exchange transformations. The problem 
of description of such words was posed by Rauzy [13]. Words with 
linear growth of the number of subwords were studied by V. Berthe, 
S. Ferenczi, Luca Q. Zamboni (|30j, [21]). They investigate combinato- 
rial sequences related with interval exchange transformations. See also 
works of P.Balazi, Z.Masakova, E.Pelantova ([20], [21], [22]). 

3. Interval exchange transformations 

Sturmian sequences can be obtained via specific rotation of unit 
circle. Interval exchange transformation generalizes circle rotation. 
G. Rauzy posed the question about description of words obtained by 
interval exchange transformation [43J. 

S.Ferenci and L. Zamboni [31] obtained the criteria of words gen- 
erated by interval exchange transformations with following condition: 
trajectory of every break point don't get on an any break point. In this 
case obviously a complexity function of the word is equal to T{n) = 
(k — l)n + 1. In fact, this is the answer to the Rauzy question. 

In the papers [HIEEE] words generated by general piecewise-continuous 
transformation of the interval were studied. This approach is quite 
different. The answer to this question is given in terms of the evolution 
of the labelled Rauzy graphs of the word W. The Rauzy graph of order 
k (the k-graph) of the word W is the directed graph whose vertices 
biuniquely correspond to the factors of length k of the word W and 
the vertex A is connected to vertex B by directed arc iff W has a 
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factor of length k + 1 such that its first k letters make the subword 
that corresponds to A and the last k symbols make the subword that 
corresponds to B. By the follower of the directed A;-graph G we call 
the directed graph Fol(G) constructed as follows: the vertices of graph 
Fol(G) are in one-to-one correspondence with the arcs of graph G and 
there exists an arc from vertex A to vertex B if and only if the head of 
the arc A in the graph G is at the notch end of B. The (k + l)-graph 
is a subgraph of the follower of the /c-graph; it results from the latter 
by removing some arcs. Vertices which are tails of (or heads of) at 
least two arcs correspond to special factors; vertices which are heads 
and tails of more than one arc correspond to bispecial factors. The 
sequence of the Rauzy fc-graphs constitutes the evolution of the Rauzy 
graphs of the word W. The Rauzy graph is said to be labelled if its 
arcs are assigned by letters I and r and some of its vertices (perhaps, 
none of them) are assigned by symbol "-" . The follower of the labelled 
Rauzy graph is the directed graph which is the follower of the latter 
(considered a Rauzy graph with the labelling neglected) and whose arcs 
are labelled according to the following rule: 

(1) Arcs that enter a branching vertex should be labelled by the 
same symbols as the arcs that enter any left successor of this 
vertex; 

(2) Arcs that go out of a branching vertex should be labelled by 
the same symbols as the arcs that go out of any right successor 
of this vertex; 

(3) If a vertex is labelled by symbol "-" , then all its right successors 
should also be labelled by symbol "-" . 

The evolution is said to be correct if, for all k > 1, the following 
conditions hold when passing from the /c-graph G fc to the (k + l)-graph 
Gk+i '■ 

( 1 ) Each vertex is an incident to at most two incoming and outgoing 
arcs; 

(2) If the graph contains no vertices corresponding to bispecial fac- 
tors, then G n+ \ coincides with the follower D(G n ); 

(3) If the vertex that corresponds to a bispecial factor is not labelled 
by symbol "-" , then the arcs that correspond to forbidden words 
are chosen among the pairs Ir and ri; 

(4) If the vertex is labelled by symbol then the arcs to be 
deleted should be chosen among the pairs // or rr. 

The evolution is said to be asymptotically correct if this condition is 
valid for all k beginning with a certain k = K. The oriented evolution 
of the graphs means that there are no vertices labelled by symbol "-" . 
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Theorem 3.1 ([TT1 19]). A uniformly recurrent word W 

(1) is generated by an interval exchange transformation if and only 
if the word is provided with the asymptotically correct evolution 
of the labelled Rauzy graphs; 

(2) is generated by an orientation-preserving interval exchange trans- 
formation if and only if the word is provided with the asymptot- 
ically correct oriented evolution of the labelled Rauzy graphs. 

The proof of this theorem consists of two stages. First proves that 
these conditions are sufficient for the word to be generated by a piecewise- 
continuous interval transformation. And the second step is to prove 
that the sets of uniformly recurrent words generated by piecewise- 
continuous interval transformations and by the interval exchange trans- 
formation are equivalent. In order to do it, there is introduced invariant 
measure which provides metric: measure of line segment is its length. 

4. Substitutional sequences 

Important class sequences are so-called substitutional sequences. These 
sequences are invariant under substitution. We refer to the paper [2S|. 
Fibonacci word is such an example. Lets consider substitution : — > 
001, 1 — >■ 01. From symbol one can get Fibonacci word as superword 
0°°(O). It is Sturmian. Tribonacci word r°°(0) = 01020100102010 can 
be generated by substitution r(0) = 01,r(l) = 02, r(2) = 0. 

In fact, in |44j G. Rauzy showed that the Tribonacci minimal subshift 
(the shift orbit closure of t) is a natural coding of a rotation on the 
2-dimensional torus T 2 ; i.e., is measure-theoretically conjugate to an 
exchange of three fractal domains on a compact set in R 2 . Each domain 
is translated by the same vector modulo a lattice. 

This is one of the most impressive results. It provides description 
of two dimensional spaces. Symbols correspond with the division of 
T 2 into fractals, called Rausy fractals. Theory of Rauzy fractals was 
generalized on the so-called Pisot substitutions. 

This type of dynamical systems provides rich structures with nice 
picture in multidimensional case. We have correspondence between 
language of arithmetics, dynamical systems and combinatorics. 

In other multidimensional systems one can get some information. 
Complexity of sequences related with multidimensional systems was 
studied in [TS] . 

4.1. language of substitutions and Rauzy graphs. Consider con- 
dition (4) in the theorem 12.11 Sturmian sequence can be obtained as 
a limit of very specific substitutions. If it is an invariant under some 
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substitution, then rotation number is quadratic irrationality. Similar 
fact is known for any rotation of circle. However, there exists substi- 
tutions such that eigenvalues of corresponding matrices are algebraic 
numbers of an arbitrary degree. 

The language of Rauzy graphs provides a bridge between combinato- 
rial and topological properties in problems regarding interval exchange 
transformations. Technique of Rauzy graphs is an important tool in 
combinatorics of words. 

Rauzy graph of Sturmian sequence has one incoming and one out- 
going branching vertice. When they coincide, (byspecial word appears) 
Gk+i can be obtained via choosing one of two possibilities, according to 
the theorem 13.11 This choice corresponds with the decomposition of a 
in chain fraction. If this choice is made in periodic way a is quadratic 
irrational. 

This fact can be generalized. Let W be a u.r. word. Suppose 
behavior of Rauzy graphs is periodic in the same sense, then W is 
equivalent to a superword invariant under some substitution. In order 
to formulate a theorem one should define what "periodicity of events" 
in Rauzy graphs means. 

Rauzy scheme of W is a sequence of graphs {Tj} such that every 
vertice of Tj is either outgoing or incoming branching vertice of some 
order, and corresponds to subword of W. Tj+i can be obtained from 
Tj via exchanging some pathes of length 2 passed through some vertice 
via arrows and deleting vertices which are not an endpoints of new 
arrows. Rauzy scheme is periodic if there exists k > such as for all 
sufficiently large i there is isomorphism between Tj and which fits 
with transmission from Tj to r i+1 . 

Theorem 4.1 (|23j). Uniformly recurrent superword is equivalent to 
image of some morphism of substitutional invariant sequence iff it has 
a periodic Rauzy scheme. 

Proof of the implication that if W has a periodic Rauzy scheme it 
is subtitutional is not so difficult. The main obstacle is the opposite 
direction. Substitution can be "bad" in the sense that image of let- 
ter cij can have a "parasite" inclusion of image of a,j. So this naive 
construction fails. We don't know explicit construction. 

Let call weight of vertice of Rauzy scheme as a length of corre- 
sponding word. In order to prove the theorem, one needs to estab- 
lish that if W is uniformly recurrent word stable under substitution, 
then ratios of weights of all Rauzy vertices are bounded. Then one can 
use J.Cassaigne result [8] saying that if W is uniformly recurrent and 
liminf Tw(n)/n < oo, then lim sup TV (n + 1) — T w {n) < oo. It follows 
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from the fact that the number of vertices in Rauzy scheme is bounded 
and gets existence of periodic Rauzy scheme from that. (Condition for 
morphic uniformly recurrent words lim inf 2V(n)/n < oo follows from 
results of Yu.Pritukin [IT] . 

Proof of ratios boundness is based on the following fact. If W is a 
morphic uniformly recurrent word then there exists a constant C{W) 
such that for any subword W u occurs in v for any v C W, \v\ > 
C(W) ■ \u\. In order to use it, one needs to construct sequence of 
Rauzy schemes in such a way, that pathes incompatible by inclusion 
correspond to the words with the same property. 

This theorem implies Vershik-Lifshic theorem jl^l IS] of periodicity 
of Bratelli diagrams of Markov compact corresponding to the substi- 
tutinal systems. 

A Bratteli diagram (V, E) is a countable collection V of finite vertex 
sets, V = {V n }™ =1 and a countable collection E of finite edge sets 
E = {E n \ c ^ =1) along with functions s : E n — >■ V^_i and r : E n — > V n 
such that (i) V = {z^o}, (ii) s : E n — » V n _i and r : E n — > V n are onto 
for all n. We view (V, E) as a directed graph where an edge e G E n 
connects the source vertex s(e) G V n to the range vertex r(e) G V n +i- 
Periodicity of Bratelli diagram means that for some k there exists a 
pair of mappings from V n to V n+ k and from E n to E n+ k preserving 
functions s and r. With Bratelli diagram one can associate topological 
dynamics. Details can be founded in |34|. 

The proof of the Vershik-Lifshic theorem uses straightforward con- 
struction. Consider image of <p( n \a) = ((p^ n ~ 2 ^)((p^ 2 \a)) for some letter 
a. It consists of blocks corresponding application of y?*™ -2 ) to the let- 
ters of (p^ (a) and also can be decomposed to the blocks corresponding 
application of {p^^ 1 ^ to the letters of <p(a). Finite sets forming Bratelli 
diagrams, consists of sequences of pairs (block, its position in bigger 
block), they corresponds some subwords of W. On these sets relations 
of being left and right neighbors inside bigger block can be naturally 
posed. The first and last occurrence in the bigger block needs special 
attention, because bigger block itself may have position and can be 
preceded or followed by another bigger block. Details can be found in 
HS1I1Z]. See also [3E]. 



5. Main constructions and definitions 

5.1. Complexity function, special factors, and uniformly re- 
current words. In this section we define the basic notions of combi- 
natorics of words. By L we denote a finite alphabet, i.e., a nonempty 
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set of elements (symbols). We use the notation A + for the set of all 
finite sequences of symbols or words. 

A finite word can always be uniquely represented in the form w = 
W\ • • • w n , where u>j G A, 1 < % < n. The number n is called the length 
of word w; it is denoted by \w\. 

The set A + of all finite words over A is a simple semigroup with 
concatenation as semigroup operation. 

If element A (the empty word) is included in the set of words, then 
this is actually the free monoid A* over A. By definition the length of 
the empty word is |A| =0. 

A word u is a subword (or factor) of a word w if there exist words 
p, q G A + such that w = puq. 

Denote the set of all factors (both finite and infinite) of a word W 
by F(W). Two infinite words W and V over alphabet A are said to be 
equivalent if F(W) = F(V). 

The beginning w b of the word w is a sequence 

x,f(x),...,fW(x),n G Z. 

We say that symbol a G A is a left (accordingly, right) extension of 
factor v if av (accordingly, va) belongs to F(W). A subword v is called 
a left (accordingly, right) special factor if it possesses at least two left 
(right) extensions. A subword v is said to be bispecial if it is both a 
left and right special factor at the same time. The number of different 
left (right) extensions of a subword is called the left (right) valence of 
this subword. 

A word W is said to be recurrent if each its factor occurs in infinitely 
many times (in the case of a doubly-infinite word, each factor occurs 
infinitely many times in both directions). A word W is said to be 
uniformly recurrent or (u.r word) if it is recurrent and, for each of 
its factor v, there exists a positive integer N(v) such that, for any 
subword u of length at least N(v) of the word W, factor v occurs in u 
subword. 

Below we formulate several theorems about u.r words, which will 
be needed later. The proof of these theorems can be found in mono- 
graph [6]. 

Theorem 5.1. The following two properties of an infinite word W are 
equivalent: 

a) For any k there exists N(k) such that any segment of length k of 
the word W occurs in any segment of length N(k) of the word W ; 



INVERSE PROBLEMS OF SYMBOLIC DYNAMICS 



11 



b) If all finite factors of a word V are at the same time finite factors 
of a word W , then all finite factors of the word W are also finite factors 
of the word V . 

Theorem 5.2. Let W be an infinite word. Then there exists a uni- 
formly recurrent word W all of whose factors are factors of W . 

One can consider the action of the shift operator r on the set of 
infinite words. The Hamming distance between words W\ and W2 is 
the quantity d{W\ 1 W2) = Xlnez ^n2~' n ', where A n = if symbols at 
the n-th positions of the words are the same and X n — 1, otherwise. 

An invariant subset is a subset of the set of all infinite words which 
is invariant under the action of r. A minimal closed invariant set, 
or briefly, m.c.i.s, is a closed (with respect to the Hamming metric 
introduced above) invariant subset which is nonempty and contains no 
closed invariant subsets except for itself and the empty subset. 

Theorem 5.3 (Properties of closed invariant sets). The following prop- 
erties of a superword W are equivalent: 

(1) W is a uniformly recurrent word; 

(2) The closed orbit ofW is minimal and is a m.c.i.s. 

Theorem 5.4. Let W be a uniformly recurrent nonperiodic infinite 
word. Then 

(1) All the words that are equivalent to W are u.r. words; the set 
of such words in uncountable; 

(2) There exist distinct u.r. words W\ 7^ W 2 which are equivalent 
to the given word and can be written as W\ = UV\, W2 = UV2, 
where U is a left-infinite word and V\ 7^ V 2 are right-infinite 
words. 

6. UNIPOTENT DYNAMICS ON A TORUS 

6.1. Essential evolution of points. Let f : M — > M be a continu- 
ous map on the space M and U C M be a subset. The starting point x 
determines a binary word w describing the evolution as above: w n — l, 
if f( n \x) G U and w n — if f^ n \x) £ U. We assume that U is an open 
set, mes(dU) = and M is a compact metric space. 

Definition 1. A finite wordv? is said to be an essential finite evolution 
of a point x* , if every neighborhood of a point x* contains an open set 
V , so that for all x G V holds v h x — v* . 

An infinite word w is said to be an essential (infinite) evolution of 
a point x* , if every initial subword is an essential finite evolution of a 
point x* . 
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The word "evolution" will further denote "essential evolution" . 

Proposition 6.1. Let v be a finite word, then the set of points with 
fixed finite evolution is closed (i.e. consists all its limit points) 

The similar statement holds for an infinite wordw. (The intersection 
of any family of closed sets is closed) 

We shall not use the next proposition although it has its own interest: 

Proposition 6.2. Let (M, /, U, x) be a dynamical system without closed 
invariant subsets, and different points have different evolutions. Let 
(M,s,U,x) be corresponding symbolical dynamical system, i.e. set of 
all superwords with Tikhonov topology. Then M is naturally isomor- 
phic to factor M by spaces consisting of sets of superwords which are 
essential evolution of one point from M, isomorphism induces natural 
isomorphisms of dynamical systems. 

6.2. Morphism of dynamics. 

Definition 2. A morphism of two dynamics G : (M 1 , fa) — > (M 2 ,/2) 
is a continuous map, such that the diagram 

M 1 -a» M 2 

hi 1/2 
Mi -a» M 2 

is commutative. 

The notions of epimorphism, monomorphism and isomorphism are 
defined in the natural way. 

The factor- dynamics is naturally defined on the quotient topology iff 
/ permutes the equivalence classes of the map /. Note that the inverse 
images of points under morphisms are closed. 

Definition 3. A set V is irreducible, if its closure is not an inverse 
image of a closed set under any morphism (except monomorphism) and 
reducible otherwise. 

Theorem 6.3. Let a set U be irreducible, then 

(1) Different points have different evolutions. 

(2) For any e > there exists N(e), such that every two words 
of length N(e), corresponding to the initial points on distance 
greater than e are different. 

Proof. 

(1) Classes of points with the same evolution are closed and the 
map / permutes them. 
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(2) The proof of this point follows from the proof for the previous 
one by contradiction and transition to limit. 

□ 

6.3. Quasi-invariant sets. 

Definition 4. A dynamics is said to be minimal if M does not contain 
closed invariant sets apart from M and and irreducible, if it does 
not contain proper Quasi-invariant sets. 

Definition 5. A closed set N is quasi- invariant if for every two points 
A and B and a convergent sequence f^ ni \A) — > C such that C E N for 
rii — > oo every limit point of the sequence f( ni \B) e N. 

Proposition 6.4. 

(1) Every closed invariant set is quasi-invariant. 

(2) A partition of quasi-invariant sets corresponds to each factor- 
dynamics and conversely. 

(3) The image of a quasi-invariant set is quasi-invariant. 

(4) If M is closed invariant set, than one quasi-invariant set uniquely 
determines the factor-dynamic. 

(5) The set of points with fixed evolution is quasi-invariant. 

Definition 6. A — B cloud (or 0-cloud) with center A, generated by 
point B is the set of conditional limit points f^ ni \B) under the condi- 
tion f^\A) ->■ A. 

A — B fc-cloud with center A, generated by point B is the closure of 
the union of set of conditional limit points f^ Hi \B) under the condition 
fM(A) ->■ A* where A* e A - B (k - 1). 

A — B /c-cloud with center A, generated by point B is the closure of 
the union of the set of A — B k-clouds, k G N. 

Note that A — B A;-cloud is closed. 

Proposition 6.5. a) The image of A — B k-cloud under the l-th iter- 
ation is f {l) (A) - f®(B)- clouds. 

b) If An ->• A, B n ^ B then p(A n - B n ,A- B) ->■ 

Denote A — I?-cloud by Iq, by Lj + i lets denote the closure of the union 
of all Ai — _Bj-clouds, for which Ai, Bi e Li. Assume Lab = [J Li. 
Factorization, generated by Lab is the weakest factorization, that glue 
points A and B. 
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6.4. Unipotent dynamics on a torus. Problems, connected with 
the study of the behavior of fractional parts of values of a polynomial at 
integer points, are in fact the classical problems of symbolic dynamics. 
Let P(n) be a polynomial of degree m + 1 with irrational coefficient 
a m+ i of the highest degree. Define a sequence of polynomials Pk{n) k = 
0, . . . , m in the following way: 

P m (n)=P(n), 
P m -i(n) = P m (n + 1) - P m (n), 

< 

P^ 1 (n) = P l (n + l)-P l (n) } 

From these formulas it follows, that Po(n) = n!a m+1 is irrational. 
Put £ = Po(n), Xiin) = {Pi(n)} and x-(n) = Xi{n + 1), then from ([T]) 
we obtain the following dynamical system: 



(1) 



x' m = x m + x m -i mod 1 
x' m _ t = x m -i + x m _ 2 mod 1 

x\ = x\ + e mod 1, 



where e is irrational as e = n\a m+ i. The condition [2{P(n)}] = 
turns to the condition < x m {n) < 1/2. 

Consequently the vector (x[, . . . , x' m ) is obtained from ( 
by unipotent transformation (transformation, corresponding to a linear 
transformation with unitary eigenvalues). 

Images of hyperplanes x m = and x m = 1/2 divide the space on 
polyhedra. The same word of length k corresponds with the points of 
the same polyhedron. 

6.5. Mismatch function. 

Definition 7. A function p is said to be a mismatch function of words 
w and v and is defined in the following way: 



0, if Wi = Vi 

1, if Wi ^ vi 



A density of mismatch of p(w, v) of words w and v is defined by the 
formula 



p(w, v) = lim 
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Theorem 6.6. Let w v be two different evolutions of the point xq G T. 
Then p(w, v) = 0. 

Proof. From lemma Wail [5] it follows, that the orbit of any point 
Xq G T is everywhere dense and evenly distributed. From the continuity 
of map /, the condition mes(dU) = and the definition of evolution 
it follows, that w n = v n if f^ n \xo) dU. The proof of the theorem 
follows from these statements. □ 

Theorem 6.7. Let points x and x* be different and have different 
evolutions. Then the density of mismatch p(w x ,w x *) is defined and is 
greater than 0. 

Proof. We can assume that words w x and w x * differ in the first position. 
Consider the direct product T xT. We divide the set of points into two 
classes: with the same current position and different current position. 
Let O C T x T be the set of different pairs, then the orbit of pair 
(x, x*) lies in O. Closed orbit of any pair is a torus, a minimal closed 
invariant set, on which the dynamics on torus is realized. Let p be the 
volume of the intersection, then p ^ and p is the density. 

□ 

6.6. Description of torus factor-dynamics. We'll consider the dy- 
namics that don't glue the coordinate x n . Consider the k-th iteration 
of the transformation of torus: 



(2) 



x 



(k) 



+ ... + C^e modi 



xf ) = Xi + C\xi-i + • • • + C l k e mod 1 



Xi — xi + C\e mod 1 



If the points A (xi, . . . , x m ) and B (xi + Axi, . . . , x m + Ax m ) belong 
to the set M', then f^ k \A) and f^ k \B) also belong to the same set. 
From (T5D it follows: 



' Ax^ ) = Ax m + C£Ax m _! + . . . + C^Axi mod 1 
(3) <J Axf ] = Ax, + C l k Axi^ + ... + C^Axx mod 1 

Ax^ = Ax i mod 1 

Proposition 6.8. Closed orbit of any pair of points is a torus T' , a 
minimal closed invariant set, on which the dynamics of torus is realized. 
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(4X 



Remark 1. For two-dimensional case there exists an area U with ana- 
lytic boundary and a starting point x* that have infinitely many different 
evolutions, that differ in an infinite number of positions. 

Proposition 6.9. Let 1, e, Axi be linearly independent over Q. Then 
A—B cloud contains all points which first i — 1 coincide with coordinates 
of point B. 

Proof. We may assume that Axj is rational when j < i and let Axj = 
Pj/qj be the presentation Axj as an irreducible fraction and k is divis- 
ible by the product m\ YYj=i Qji then Xj = Xj when j < i and systems 
(J2D and ([3]) can be rewritten as 

Xm = Xm + + • • • + C^E mod 1 

xf 1 ' = xj + C\ x x^x + • • • + C J kl e mod 1 

x^ = x\ + C\ x e mod 1 
Aa?S° = Ax m + CfaAxm-i + ■■■ + C^Axi mod 1 

Axf ) = Ax, + ChAxj-t + ... + C{f l Ax 1 mod l(j > i) 
Axj = Axj mod l(j < i) 

Vector (x^ l \ Xm\Ax[ kl \ Ax^) by lemma Wail j5] is 

everywhere dense in the torus of dimension 2m — i + 1, then from 
definition of A — B cloud it follows that it contains all points, which 
first i — 1 coordinates coincide with coordinates of the point B, and 
other points can be chosen arbitrarily. □ 

Proposition 6.10. Let Axi be an irrational number. Then there exists 
a point B n , the evolution of which coincide with evolution of points A 
and B, and for which it holds: 

Axf" = nAxf. 

Proof. Lets choose points A and B as B and Bi respectively. By 
the method of mathematical induction we'll assume that point Bj, is 
already built, and as for -B^+i it suffices to take any conditionally limit 
point of a sequence f {ni) {B k ), under the condition f rH \B k ^) ->- B k . 
Note, that the point B k G A — B fc-cloud. 

□ 



Proposition 6.11. Let Axi be an irrational number. Then there exists 
a point Bs, the evolution of which coincide with evolutions of points A 
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and B, for which it holds: 

Axf s =6,0<8<1. 

This fact follows directly from the proposition 16. 101 and the fact that 
the set of point with fixed evolution is closed. 

Thus the case when Axi is irrational reduces to the case when 1, e, 
L\Xi are linearly independent over Q. 

The case when all Axj are rational leads to factor-dynamics where 
i edge of torus divides in Mj = YYj=i m i P ar t s ) where mj are arbitrary 
natural numbers and points of the torus x = x* are identified when for 
all 1 < j < m holds: MjXj = Mjx*. 

Description of Quasi- invariant sets follows from the proposition 16.111 

Theorem 6.12 (Description of Quasi- invariant sets). Quasi-invariant 
set is a shift of abelian group which transforms into themselves under 
translations by 1/Mj along i-coordinate or under all translations along 
coordinates with the number, greater than some fixed number. 

The theorem provides description of all possible factor-dynamics. 

Corollary 6.13. The class of closed reducible sets consists of sets, 
which transform into themselves under translations by 1 /Mj along i- 
coordinate or under all translations along coordinates with the number, 
greater than some fixed number. The set < x m < 1/2 is irreducible. 

6.7. Proof of the theorem 11.11 This theorem follows from the next 
proposition: 

Proposition 6.14. Consider the dynamics of the torus, given by equa- 
tion |D) and let irreducible set U is given by the condition: 

0<x m < 1/2, 

then there exists such L(e) that points with the same finite evolution 
of length L(e) divide torus on closed convex polyhedra, and different 
polyhedra correspond with the different evolutions. 

Proof. The set U is irreducible, so by theorem 16.31 there exists 5' such 
that N(5') evolutions of points at a distance greater than 5' are dif- 
ferent. It is obvious, that iV-evolutions of all interior points of the 
polyhedron, obtained after 2-th iteration are the same. We call it the 
evolution of the polyhedron. 

Consider parts of the partition, corresponding to words of length 
N(S'), then each polyhedron of the partition can't intersect more than 
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one hyperface from n + 1 family of planes f n [x\ — q/2, q G N, other- 
wise we can choose two points, that belong to this polyhedron and are 
divided by two planes from n + 1 family. 

Assuming n + 1 family to be the first and turning the time back we 
get the situation when N(S') evolution of these points doesn't allow to 
distinguish them, and this is impossible. Polyhedra formed by n + 1 
family of planes with the same evolution can't have common points. 
Let S* be the minimal distance between polyhedra with the same evolu- 
tions, then by putting L{e) = m&x(N(5'), N(S*)) we obtain the num- 
ber of iterations, from which certainly achieve the equality between 
number of words of length k and the number of polyhedra in which 
the torus divides k families of planes f l [xi = q/2, q = 0, . . . , k — 1. 
When (e) is irrational the intersection of more than m hyperplanes is 
empty and there exists a one-to-one correspondence between points of 
intersection of m non-parallel planes and polyhedra of partition. By 
computing the number of points in the intersection of k families of 
planes f l [xi = q/2, q = 0, . . . , k — 1 we obtain a polynomial, which 
defines the number of parts and consequently the number of subwords 
from some moment. □ 

The number of points of the intersection of hyperplanes Q(k) and 
consequently the number of subwords T(k) (k > K) of length k can be 
calculated by the formula: 



Q(k) = £ 



0<fci<...<fc m <fc 





i (Y) • 


/ km \ 

\ m ) 


a nn\ rn(m+l) 
, degQ(k) = 


<k 


i : 


■■ ( kl ) 





For every K there exists P and k > K such that the equality 
T(k) = Q(k) holds for k ^ k and doesn't hold for k . 
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